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ABSTRACT 

Using  a  simple  martingale  representation  result  a  conditional  version  of  the  Malli- 
avin  calculus  is  developed.  Under  Hormander’s  conditions  on  the  coefficient  vector  fields 

^'  weehow  the  filtering,  smoothing  and  prediction  problems  ^have  C°°  density  solutions.  ■- 

C>:  :  ■  ■>  -o  i ,  »  " 

1.  INTRODUCTION.  Following  Malliavin’s  remarkable  work  [6]  there  have  been 
other  treatments  of  the  Malliavin  calculus,  including  those  of  Bismut  [l],  Stroock  [8] 
and  Norris  [7].  A  particularly  readable  account  can  be  found  in  the  paper  of  Zakai  [9j. 
In  [2]  Bismut  and  Michel  developed  a  conditional  version  of  the  Malliavin  calculus  to 
show  the  existence  of  a  conditional  density  in  filtering  and  smoothing  problems.  Using 
a  simple  and  natural  expression  for  the  integrand  in  a  stochastic  integral  the  authors  [4] 
have  been  able  to  give  an  elementary  proof  of  the  existence  of  a  density  for  a  diffusion, 

'Research  partially  eupported  by  the  Air  Force  Office  of  Scientific  Reeearch,  United  State*  Air  Force,  under 
grant  AFOSR-83-0332  and  European  Office  of  Aeroipace  Research  and  Development,  London,  England 
3Re»earch  partially  eupported  by  NSERC  Grant  A  7934 

The  author*  are  grateful  to  Provecior  E.  Pardoux  for  pointing  out  an  error  in  an  earlier  veraion  of  the 
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under  Hdrmander's  conditions  for  the  coefficient  vector  fields.  The  homogeneous  chaos 
expansion  of  the  random  variable  is  also  obtained  in  [4].  The  objective  of  this  paper 
is  to  present  a  conditional  version  of  the  results  of  [4]  and,  following  the  exposition  of 
Zakai,  simplify  some  of  the  results  of  Bismut  and  Michel. 

In  this  paper  the  following  system  of  stochastic  differential  equations  is  considered: 

dx  =  Xo[x,y)dt  +  Xi(xty)dwl  +  Xj(x,y)dB’  +  Xj(x,y)h’(x,y)dt. 
dy  =  Y0{y)dt  +  Yj{y)dB>  +  Y](y)h]{x,y)dt. 

Here  w  —  (u/1,  ...,  u/n)  and  B  =  (J91,  Bn)  are  independent  Brownian 

motions.  The  process  x  represents  the  unobserved  signal  process,  while  y  represents  the 

observation  process.  If  {Y*}  is  the  right  continuous,  complete  filtration 

generated  by  {y*}  then  the  filtering  problem  discusses  E\xt\Yt},  the  prediction  problem 

discusses  E[xt\YB\  when  s  <  t,  and  the  smoothing  problem  discusses  E[xt\Yt]  when 

s  >  t. 

Using  the  simple  martingale  representation  result  of  [4]  a  conditional  version  of 
the  Malliavin  calculus  is  developed  in  section  4.  Suppose  T  >t  and  let  c  be  any  smooth 
function  on  Rd  with  bounded  derivatives  of  all  orders.  In  section  5  we  show  that  if  the 
inverse  of  the  conditional  Malliavin  matrix  Af  belongs  to  U>{ fl)  for  all  p,  1  <  p  <  oo, 
then 

\E[-Q^{xt)\YT]\  <  K(y)  jsup  |c(x)| 

for  all  multi-indices  a  =  (ati,  ...,  at)  where  K(y)  is  a  Yj’-measurable  random  variable 
which  is  finite  a.s.  The  delicate  and  technical  sufficient  conditions  for  the  integrability 
of  M~p  are  not  discussed  in  this  paper. 
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This  inequality,  using  simple  Fourier  analysis,  implies  that  the  random  variable 
xt  has  almost  surely  a  conditional  density  given  Yt,  which  is  infinitely  differentiable. 
Using  Jensen’s  inequality  we  can  immediately  deduce 

l^[^(x*)l^]l  <  K'{y)  sup  |c(z)| 

dx  x€Ri 

where  s  >  t  or  s  <  t.  Therefore,  the  smoothing,  filtering  and  prediction  problems  for 
xt,  given  Ys,  have,  almost  surely,  smooth  conditional  density  solutions. 


2.  STOCHASTIC  FLOWS 

We  recall  in  this  section  the  properties  of  stochastic  flows,  and  in  particular  those 
relating  to  ‘lower  triangular’  systems  obtained  by  Norris  [7].  Let  tvt  —  ( tv /,  . ..,  tu"), 
t  >  0,  be  an  n-dimensional  Brownian  motion  on  (Q,  F,  P).  Write  {Ft}  for  the  right 
continuous,  complete  filtration  generated  by  w.  Suppose  Xo,  X\ ,  Xm  are  smooth 
vector  fields  on  [0,oo]  x  Rd,  all  of  whose  derivatives  are  bounded.  Then  from  Bismut 
[l],  or  Carverhill  and  Elworthy  [3],  we  quote  the  following  result: 

THEOREM  2.1.  There  is  a  map  £  :  H  X  [0,oo)  x  (0,oo)  x  Rf1  — ►  K1  such  that 

i)  for  0  <  s  <  t  and  x  €  Rd  £tlt  (x)  is  the  essentially  unique  solution  of  the  stochastic 
differential  equation 

dt'jt  (*)  =  X0(t,  fa  (x))<ft  +  Xi(t,  £tit  ( x))dw\  (2.1) 

with  (*)  =  x. 

(Note  the  Einstein  summation  convention  is  used). 

ii)  for  each  w,  a,  t  the  map  £t<t  (•)  is  C°°  on  R^  with  a  Erst  derivative,  the  Jacobian, 

=  Dt,t ,  which  satisfies 

dD»Jt  =  ^jy-(t,  ( x))D,<i  dt  +  ^y(t,  Cajt  ( x))D*,t  dw't  (2.2) 

with  initial  condition  Dt,t  =  I,  the  d  x  d  identity  matrix. 

REMARKS  2.2.  Note  that  (2.2)  is  obtained  formally  by  differentiating  (2.1).  In  fact 
equations  for  higher  derivaties  -gpt  are  obtained  by  further  differentiation.  However, 
if  we  consider  the  enlarged  system  given  by  (2.1)  and  (2.2)  the  coefficients  are  not 
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bounded,  because  of  the  linear  appearance  of  D,tt  on  the  right  of  (2.2).  However, 
Norris  [7]  has  extended  the  results  of  Theorem  2.1  to  such  systems.  To  state  Norris' 
results  we  first  define  a  class  of  ‘lower  triangular’  coefficients. 


DEFINITION  2.3.  For  positive  integers  a,d,di,  ...,  dk  write  Sa (di ,  ...,  d*)  for  the  set 
of  Xe  C°°(Rd,Rd)  of  the  form 


X(z)  = 

f  ^(x1)  ^ 

xWfz1,!2) 

for  x  = 

s' 

"h  •  ■  * 
s — _ . 

^  X<*>  (x1,  x2,...,  xk)j 

WJ 

where  Rd  is  identified  with  Rdl  x  •  •  •  x  R** ,  xJ  G  Rd>  and  the  X  satisfy 

OTsMO  =  8UP  (  SUP  77-,^^  V  sup  |DyArW(x)|) 

seif1  MK»  <N  (1  +  1*1)  I<y<*  / 

<  oo  for  all  positive  integers  N.  (2.4) 

Write  S(dlt  . . . ,  dk)  =  USa(<*l,  . . . ,  d*). 

a 

REMARKS  2.4.  Note  equations  (2.1)  and  (2.2)  can  be  considered  as  a  single  system 
whose  coefficients  are  not  bounded,  but  are  in  S[d,  d2).  The  final  supremum  on  the  right 
of  (2.4)  implies  the  first  derivatives  of  X^1)  are  bounded,  as  are  the  first  derivatives  Dj 
in  the  ‘new’  variable  xJ  of  X^  (x1,  ...,  xJ).  This  means  xw  is  allowed  linear  growth 
in  xJ,  a  situation  illustrated  in  (2.2).  We  quote  from  Norris  the  following  extension  of 
Theorem  2.1. 

THEOREM  2.5.  Let  Xo,  Xj, ...,  Xm  G  Sa(di, ...,  d*).  Then  there  is  a  map  $  :  fl  x 
[0,oo)  x  (0,oo)  x  R?  —*■  R?  such  that 
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i)  for  0  <  s  <  t  and  x  G  Rd  <f>(ut,8,t,x)  is  the  essentially  uniques  solution  of  the 


stochastic  differential  equation 


dxt  =  Xq  (xt)dt  +  Xi(xt)dw\ 


with  x,  =  x. 


ii)  for  each  u>,  s,t  the  map  <f>{u,s,t,x)  is  C°°  in  x  with  derivatives  of  all  orders  satisfying 
stochastic  differential  equations  obtained  from  (2.5)  by  formal  differentiation. 


sup  e  \  sup  \DN 4>{uj,  «,u,x)|,’] 
\x\<R  L«<u<t  J 


^  C(Pj a, TV, dj ,  ...,  djt, ot, || Jfo||s(a,AQ  »  •••>  ||-T^»||s(a,N)  )•  (2-6) 


REMARKS  2.6.  Norris  proves  Theorem  2.5  by  induction  on  j.  Write  (2.5)  as  a  system 
of  stochastic  differential  equations  for  j  =  1,  . . . ,  k 

dxj  =  (xj ,  . . . ,  x{)dt  +  XW  (xj  j ...  f  xj)dw{ 


x*  =xt  e  Rd> . 


Suppose  the  result  is  true  for  1  =  1,  ...,j  —  1  and  write  Xt-  (u>, s,t, x3) 

=  X&  (xj (w),  ....  xj'1  (w),x>).  Then  (2.7)  can  be  written  in  the  form 


dxj  =  Xo(s,t,xj)dt  +  Xi(s,t,xj)dwt 


and  Theorem  2.1  applied.  The  difficult  step  is  establishing  the  result  for 


A 


j  =  1.  However,  this  follows  by  a  stopping  time  argument,  which  is  essentially  the 


(  >  *  :  M 


►  >•  IV  i * 


method  by  Bismut  [1].  Using  the  notation  of  Theorem  2.1  consider  the  process  V 
defined  by 

dVttt  =  -  Vtjt  (^(t,  (x))  -  Ut  (*))  dt 

^  1=1 

-  V.jt  t,,t  {x))dw't,  (2.8) 

with  Vt  t  =  I.  Then  by  applying  the  Ito  rule  we  see  d(Dtyt  Vtt )  =  0,  while  De<t  =  A 
the  d  x  d  identity  matrix.  Therefore,  Vitt  =  .  By  applying  Theorem  2.5 (iii)  to 

the  system  given  by  equations  (2.1),  (2.2)  and  (2.8)  we  have  \D*t*  \  =  sup  \DttU  |  and 

•<u<t 

1 17*  I  =  sup  |V#1*  I  are  in  1/(0)  for  all  p  <  oo.  Finally,  for  0  <  s  <  t,  recall,  by  the 
’  *<«<< 

uniqueness  of  the  solution  of  (2.1): 

£o,t  (x0)  =  6^  (6v  (xo))  =  (»jt  (x),  if  x  =  fa*  (xo).  (2-9) 

Differentiating  (2.9): 


=  Dtf  Do,, 


(2.10) 
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3.  MARTINGALE  REPRESENTATION. 

Consider  a  stochastic  differential  system  with  coefficients  in  some  set  S ,  as  dis¬ 
cussed  in  Theorem  2.5,  and  let  £o,t  (x,0)  be  its  stochastic  flow  solution.  For  some  T  >  0 
consider  a  real  valued  differentiable  function  c  for  which  the  random  variable  e({o p  (xo)) 
and  the  components  of  the  gradient  C£(£o,t(xo))  are  integrable.  Let  Mt  be  the  right 
continuous  version  of  the  martingale 

£[c(£o,r  (zo))|F*]. 

Then  we  have  the  following  representation  result,  (see  [4]),  whose  proof  we  give  for 
completeness. 

THEOREM  3.1.  For  0  <t<T,  Mt  =  -E[c(£o,T  (*o))1  +  /0*  'u(s)dwlt  where 
Tfi(s)  =  E[c((fa  (xo))D0,t  X,(s,  fa  (x0)). 

PROOF.  It  is  well  known  that  Mt  has  a  representation 

Mt  =  M0  +  f  'ji(s)dxv\ 

Jo 

for  some  predictable  integrands  7,-.  Because  the  process  fa  (x0)  is  Markov 

M  =  £?[c(foj(x0))|Ft] 

=  ^K^(x))|Ft) 

=  *|e(€*r(*))] 

=  V(t,x),  say,  where  x  =  fa  (x0). 


(3.1) 


(3.2) 
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By  Theorem  2.5  and  the  chain  rule  c(Cty  (x))  is  differentiable,  in  fact  smooth,  in  x. 
The  differentiability  of  Ett*  [c(6,r  (*))]  in  *  can  be  established  by  writing  the  backward 
equation  for  Ct?  (x),  as  in  Kunita  [5].  Consequently,  applying  the  Ito  role  to  V (t,  z), 
with  x  =  (zq)  we  have 


r *  qv 

V{t,toA* o))  =  V(0,xo)  +■■]  (  ^7  +^)da 

dv 

"fci**  to,*  M)Xi(s,  Co,*  (z0))dt<;i  (3.3) 

d  t  -  (i  TfX  r\  J 

where  L  =  Xo^r  +  5  E  (J2  X^Xl)  5^5^.  •  By  the  uniqueness  of  the  decompo¬ 
sition  of  special  semimartingales,  comparing  (3.1)  and  (3.3),  we  must  have,  (as  is  well 


and  'u(s)  =  §£(s,  Co,  (x0))X,(«,  (so))-  from  (3.2)  ^  =  E[c(((*jr  (x))Dtp  \Ft]  so 

by  (2.10)  7,(«)  =  £[ce(6),r  (*o))  A),T  *<(«,  Co,S  (*o))- 


COROLLARY  3.2.  The  result  extends  immediately  to  vector  (or  matrix)  functions  c 


COROLLARY  3.3.  Note  in  particular 

fT 

c(eoj’(xo))  =  £[c(eo^(xo))]+  /  £M£o,r  (xo))D0p  |F.l£)0-i  X<(s,  Co,s  (*o  ))<*«>;  (3.4) 

Jo 

LEMMA  3.4.  Ft  is  generated  by  the  set  of  stochastic  integrals  of  the  form  7 i(s,wt)dw\, 

where  the  integrands  7 ,■  are  smooth  functions  of  s  and  w,  at  time  s,  with  bounded 


derivatives  of  all  orders. 


PROOF.  a{wt}  is  generated  by  g(wt)  for  g  £  C%°(Rd).  If  we  apply  Theorem  3.1  to  the 
process  Wt,  so  wt  =  x  +  {wt  —  to,)  where  x  =  wt,  the  Jacobian  is  the  identity  I  and 

=  E,jt[gw{wt)\  =  7M 

where  7(0;,)  =  («;,),  . . . ,  7m  (to,))  =  EtiW,  ((ffwi  (to,),  ...,g^  (to*))].  Therefore,  g(wt)  = 

F[y(t«t)]  +  Jq  'fi(wt)dwxt  where  the  7,-  £  C£°(Rd).  Consequently  a(wt)  is  generated  by 
stochastic  integrals  of  this  form.  Allowing  the  integrands  to  depend  on  s  we  that  Ft, 
which  is  generated  by  wt,  for  s  <  t,  is  generated  by  stochastic  integrals  of  the  form 

/  'ii{s,wt)dw\ 

Jo 

where  7,-  £  C“([0, 00)  x  Rn). 

REMARKS  3.5.  So  far  we  have  considered  an  n-dimensional  Brownian  motion  w  — 

(to1,  ...,  wm)  and  a  state  vector  x  £  Rd.  Consider  now  a  larger  system:  suppose 

B  =  (B\...t  Bn)  is  an  n-dimensional  Brownian  motion,  defined  on  a  probability  space 

(fl,  F,  P),  which  is  independent  of  w.  Write  {Ft}  for  the  right  continuous,  complete 

filtration  generated  by  B ,  and  {Gf }  for  the  right  continuous,  complete  filtration  on 

fl  x  fl  generated  by  Ft  x  Ff.  Consider  a  second  state  vector  y  £  Rv  and  a  stochastic 

^  ^ 

differential  system  defined  on  (fl  X  0,  F  X  F,  P  x  P)  by  the  equations 

dxt  =  Xo{xt,yt)dt  +  Xi(xt,yt)dw't  +  X}(x  t,yt)dB} 

dyt  =  Y0{yt)dt  +  Yj(yt)dB}  (3.5) 

with  (x(0),y(0))  =  (xo,yo)  £  Rd  x  Rp.  We  shall  suppose  the  coefficient  vector  fields 
Xq,  ... ,Xm ,  Yn  are  such  that  the  coefficients  of  (3.5)  belong  to  the  space  5, 
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so  that  Theorem  2.5  can  be  applied.  Note  that  in  (3.5)  the  process  y  is  not  influenced 
by  the  process  x. 


NOTATION  3.6.  Suppose  (x,y)  €  Rd  x  Rp  is  the  state  of  the  system  (3.5)  at  time  s. 
We  shall  denote  the  solution  flow  of  (3.5)  for  t  >  s  by  the  map 


(x,y)  -►  (xs,t(x,y),ye>t  (y)) 


The  Jacobian  of  this  map  looks  like 

dix»,t  (z,y),y»,t  (y))  _ 
d(x,y) 


dy 


Write  De>t(x,y)  for  the  ‘partial*  Jacobian  ~  ^  •  The  existence  of  the  large  Ja¬ 

cobian,  and,  therefore,  of  its  components,  including  Dg  t ,  is  given  by  Theorem  2.5. 

As  in  [2],  we  now  introduce  a  new  measure  on  (Cl  x  Cl,  F  X  F )  by  a  Girsanov 
change  of  density. 


NOTATION  3.7.  Suppose  h(x,y)  =  (A1  (x,  y),  . . . ,  hn(x,y))  is  a  smooth  function  in 
C°°(Rd+p  ,Rn)  with  bounded  derivatives  of  all  orders.  Define  the  real  valued  process 
L  on  n  x  H  x  [0,  oo)2  x  Rd  x  R?  by 

L»,t[x,y)  =  exp  |  j  h’(xs>u  (x,y),ylyV  (y))dB{ 

l . n ..  r*  % 

~2^j  si¬ 


lken 


dL*,t  (x>  y)  -  L*,t  (x,  y)(h’(xttt  (x,  y),yt<l  (x,  y))dB{ 


yw-\  '‘s'i 

ypPPP 


with  Lt)t{ x,y)  =  1,  so  L  is  a  {Gt}  martingale.  Furthermore,  Lqj  =  sup Lo,«  is  in  every 

«<i 

space  IP  (Cl),  1  <  p  <  oo.  Because  h  is  bounded  we  also  have  that  (LqJ  )*  =  sup(Z/0^) 

V  <t 

is  in  every  IP (fl),  1  <  p  <  oo.  We  could  consider  the  flow  given  by  the  combinded 
system  (3.5)  and  (3.7).  However,  for  the  moment  note  that  for  0  <  s  <  t 


Lo,t{x o,Po)  =  Lo,»  {xo,yo)L,tt  (x,y) 


(3.8) 


so  writing  L  =  Lotl  (xq,j/o)  we  have 


dLo,t  T  ,  ^ 

~ir  = 


and 


Lr  (T  i  r  dL*t  n 

dx0  ~  3xo  +  ax  ^ 


with  a  similar  equation  for 


(3.9) 


DEFINITION  3.8.  Define  a  measure  P*  on  (0  x  Q,  P  x  P)  such  that  its  restriction  to 
G*  is  given  by 

dPh(u/,w)  =  L0,t  ( x0,y0)dP{w )  x  dP(w). 

Then  Girsanov‘s  theorem  states: 


THEOREM  3.9.  Under  Pk  the  process  B'  is  an  n-dimensional  Brownian  motion  inde¬ 


pendent  of  to,  where 


Therefore,  under  the  measure  P&  the  process  (xt,t ,  y$,t )  is  the  solution  of  the  stochastic 
differential  equation 

dxtJt  =XQ  (xttt ,  y,jt  )dt  +  Xi  (xt)t ,  yM  )dw\  +  X,  (xt>t ,  y,it  )dB[3 

+  W  ix*,t .  y*,t  )dt 

dy»jt  =Yo{yt<t  )dt  +  Yj(yttt  )dB[3  +  Yj{ytJl  )h3  (xtit ,  yt>t  )dt  (3.10) 

with  ( xt>t,yStt )  =  (x,y)  eRd  *RP. 

REMARKS  3.10.  The  system  (3.9)  provides  a  natural  setting  in  which  to  discuss  fil¬ 
tering,  smoothing  or  prediction  problems.  The  process  x*  represents  a  signal  which  is 
not  observed  directly.  Instead  one  observes  the  process  y*  which  is  influenced  by  xt 
through  the  process  h(xt,yt).  Write  { Yt }  for  the  right  continuous,  complete  filtration 
generated  by  y,  and  Ek  for  expectation  under  P\.  The  filtering  problem  discusses 

Eh[xt\Yt], 

the  smoothing  problem  discusses 

Eh[xt\Yr\,  where  t  <T, 
and  the  prediction  problem  discusses 

Eh[xt\YT\,  where  t  >T. 

In  this  paper,  using  the  techniques  of  the  Malliavin  calculus  ,  we  show  in  the  filtering, 
smoothing  and  prediction  cases,  that  the  conditional  distribution  of  x<  has  a  smooth 
density. 


4.  INTEGRATION  BY  PARTS 

Suppose  0  <  t  <  T  and  let  Uoy  (£>)  be  an  Fy  measurable  random  variable  of  the 
form  discussed  in  Lemma  3.4,  that  is 

Uo, t(u)=[  (4.1) 

Jo 

where  7/  6  C£°  ([0, 00)  x  Rn  for  1  <  j  <  n.  Consider  the  system  given  by  (3.5),  (3.7) 
and  (4.1)  on  (fl  x  fi,  F  x  F,  P  x  P)  : 

dxg't  —  Xo{x»f  »  )dt  +  X{  {%t,t  1  Vtf  •+-  ti/t^)dB^ 

dy»,t  =  Y0  (y4ft )  dt  +  Yj  (ylit )  dB’t  (4.2) 

dLgtt  =  Lffh?  (x,,$  ,ytlt)dBf 

dUgtt  =7,  (t,Bt)dB( 

Then  Theorem  2.5, with  (xM ,  yta ,  Lta ,  Ut<t )  =  (x,y,l,0),  can  be  applied  to  (4.2)  and 
we  can  consider  the  associated  stochastic  flow.  Note  Ut<t  does  not  involve  x,y,  or  L, 
and  if  U0i9  =  U  then 

U0,t  =U  +  J*  ~i}(s,Bt)dB{.  (4.3) 

Also,  if  L  =  I/o,,  ,  from  (3.8) 

Lojt  =  LLtJt(s,y).  (4.4) 

THEOREM  4.1.  Suppose  0  <  t  <  T  and  let  c  be  a  C°°  function  on  jR?  with  bounded 

derivatives  of  all  orders.  Then  for  any  square  integrable  predictable  process  u(s)  = 
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E[Uop  Lqjc{xq^  (zo.yo))  f 

Jo 

m  ,t 

=  ^E[Uoy'Lo^cs{xoji{xo,yo))Dojt  /  A^.X;*(«)u,-(«)<h] 

,=i  jQ 

+  E[U0p  Lop  Lq$>  c{x0ji  (x0,  yo)) £>o,J  X,(«)u,(s)<k] 
“  Y1  EiUo?  Lop  c(xQit  (x0,yo))  L^'l  ~J^~Do,l  X,(s)u,(s)ds]. 


PROOF.  First  recall  the  derivation  of  Theorem  3.1  and  write  for  0  <  s  < 


V{s,x,y,L,U)  =  E\U0p Lop  {xQ,yo)c{xQ<i{xo,yQ))\G.] 

=  E[{U  +  U,p  )LL$p  (x,y)c(x,t*(x,y))|G,] 

=  Et,%,y,Lfj  |(17  +  Utp)LL,p(x,y)c(x,ft(s,y))]. 


(4.5) 

t  <  T 

(4.6) 


The  martingale  representation  result  is  obtained  by  writing  down  the  Ito  formula  for  V, 
and  the  derivatives  of  V  are  found  by  differentiating  the  conditional  expectation  (4.5) 
in  x,y,L  and  U.  Note  that  for  s  >  t  the  derivative  of  e(x<f*  (x,y))  in  x  is  zero.  We, 


therefore,  have 


UopLop  (xo,yo)c(^  (20, yo))  =  £[^o,r  ^,r(a:o»ybMxo,t  (*o.yo))] 

+  /  Wo?  Lo,t  Cx(xo^  (*o,yo))Atf  \G.)Doi  {X{dw't  +  X,dB’) 
Jo 

+  J*  E[U0?Lorcx(xos  (xo,yo))]d^Xl^,y)) ) Gt]YjdB’ 

+  f  E[Uop  Ltpc(xoj  [xo,yo)\Gt\hj dB{ 

Jo 


+ 


T 

f  ElUofL^^ix^cixosixo'yoWGMXidw*  +  X,dB’) 
Jo 


dx 

dLtp 


+  J  E[U0p  L-^-  (x,y)c(x0tt  (x0,yo))| G,)YjdB{ 

+  [  E[Utp LoFc(xof[xotyo))\Gt]'i}dBi. 

Jo 


(4.7) 


T 

Taking  the  product  of  (4.7)  with  fQ  ut-(s)dw J,  because  w  and  B  are  independent  under 


P  x  P,  we  have 


rT 

E[U0p  Lo?  c(x0,t )  /  u«‘(tf)du/*] 

Jo 

m  -t 

=  E[U0J  Lop  e*(x0, t  )Dos  /  Xt(a)u, (s)d«] 

.=1  Jo 

+  £>[%x  £«(*,,.)/’  !^(x, »)*(,)*]. 


(4.8) 


ftom  (3.9)  jPoJ  -  L,x (x,y)D0 J .  Substituting  in  (4.8)  the  result 


follows. 


NOTATION  4.2.  Write  *  for  the  transpose.  Furthermore  write  Bop  =  /J  {Dq]  Xi(s))* dw\, 
^oj  =  Y  fo  Eq*  ^£^-DojXi{*)Xi{a)*{Do*  )*da  and  recall  the  Malliavin  matrix,  (l], 

1=1  vXQ 

[4],  (which  here  is  a  ‘partial’  Malliavin  matrix  in  the  X,-  vector  fields): 

M°?  ='£,[  D£X<(‘)X<W(Dvir<U. 
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COROLLARY  4.3.  We  then  have  the  special  case  of  Theorem  4.1  obtained  by  taking 

«<(•)  =  (£#*(>))•  : 

E[Uop  Lop  e(xo,i  ]  =  E[Uop  Lop  cx (xo.t  )Dott  Mq ,t  ] 

+  E[Uop  Lop  Lqj-  c(x oj  )  g  Efop  ]•  (4-9) 

-  E[Uop  Lop  e(x0>f  )Ao,r  ] 

COROLLARY  4.4.  Equation  (4.9)  is  still  true  for  vector,  (or  matrix),  functions  c. 

REMARKS  4.5  The  gradient  cs  of  c  occurs  in  only  one  term,  so  (4.9)  is  an 
‘integration  by  parts’  formula.  Suppose  g  is  a  second  smooth  function  with  bounded 
derivatives  of  all  orders.  Applying  (4.9)  to  the  product  c(x0ft)ff(xo^)  we  have 

E[U0p  Lop  c(x0tt  )g(x0tt  )Rop  ] 

=  E\Uop  Lop  (c,(xo^  )g{xott )  +  c(xo,t  )gx  (xo^  ))A>,t  Mqjl  ] 

+  E[Uop  Lop  Lop  c(xqj  )g(zoj )  -^—Mop  ] 

-  E[Uop  Lop  c(xo,t  )j?(xo^ ) hop  ].  (4.10) 

From  Lemma  3.4  the  random  variables  Uop  generate  Ft  so  (4.10)  can  be  written 

E[Lop  c(x0^  )y(*o^  )Rop  i-Fr] 

=  E[Lop  (c,  (xo^  )ff(xo,t )  +  c(x0^  )<7*  (x<v  ))  Am  Mojt  |Fr] 

+  E\Lop  Lop  c(x 04  )g{zoj )  \Ft } 

-  E[Lop  c(x0,t  )g(z 04  )A0p  \Ft ]• 
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Under  P  x  P ,  Yr  C  Ft  so 


E[Loj  c{xQji  )g(xQtt  )Ro?  \Yt] 

=  E[Lop  (c,  (z0,<  )g[xojt )  +  c(z0><  )g%  (zoj  ))D0j  M0>t \Yt) 

+  E[Lox  Lo?  cixojt  )g{*Oji )  Mox \YT] 

-  E[Lof  c(xoit)g(xojt)Aox  \  Yt\. 

Now  ^fc[e(xo,<  )g(xojt)RoF  \Yt]  =  E[Lopc(xof)g(xott)Ro?  \Yr\{E[Lop  |Yr])-1  •  Further¬ 
more,  LotT  >  0  a.s.;  therefore 

E[Lo?  \Yt\~1  <  oo  a.s. 

Consequently,  dividing  by  E[Lqj \Yt\  we  have 

EhHxQjt  )g{x Ot  )£ojr  |  Frl 

=  Ek  [(c*  (x0it  )g(x0it )  +  c{xojt  )gx  (z0>*  ))D0,t  M0,t  | Yt] 

+  E\LZj  c(xoj  )y(x0,r )  Mop  |Fr] 

-  Ek[c(xoj  )g(xo j ) Aoj  1  Yt]  ,  (4.11) 

where  both  sides  are  finite  a.s. 

With  this  in  mind,  to  obtain  a  bound  for  the  conditional  expectation  JEfc[c*(zo,*  )|Fr] 
we  would  like  to  take  g  =  MqJ  in  (4.11).  However,  and  Mqj  involve  the  past 
of  the  process  £o,t ,  A),t  and  Moj .  This  difficulty  can  be  circumvented  by  considering 
an  enlarged  system. 


NOTATION  4.6.  Let  (w,u>,a,t,x,y, L,U)  denote  the  flow  associated  with  the  sys¬ 
tem  (4.2).  Write  for  the  Jacobian  associated  with  this  flow.  Note  that  among 
the  components  of  D^J  are  the  ‘partial*  Jacobian  ^  =  D*,t  a°d  the  gradient 

aL,$'Vl  Write 

R i?  =  R>f  =  j\D7'.X<  (“))'<K 
<?  =  V  =  £  f'^^D-lXMxM-(D7,lydU 
M?1  =M.,=jr  ['  D~l  Xi(u)Xi(uy  (D-lydu. 

i=  1  Jt 


Then  the  system 


=  (^(°) ,  £>(°) ,  j?(°) ,  ,  A<°> ) 


is  Markov  with  coefficients  in 


5(d  +  p  +  2,<f  +  p  +  2  +  (d  +  p  +  2)3,2<f  +  p  +  2+(d  +  p  +  2)2,2d  +  p  +  2  +  2(<i  +  p  +  2)J,l). 

The  results  of  Theorem  2.5  apply  to  this  system  and  its  flow  <f> M .  Note  that  M,tt  is 
the  predictable  quadratic  variation  of  the  tensor  product  of  R$tt  with  itself.  Write 
for  the  coeflicient  vector  fields  of  the  w*  integrals  in  <f>^ ,  and  D^J  for  the  Jacobian  of 
<t>W .  Also  write 

4‘i  =  /' (»(;)-'  m)<k 

Jo 

and  for  the  predictable  quadratic  variation  of  the  tensor  product  of  with 

R^t  which  we  shall  denote  by 
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Then  define  4>&  =  (*W  ,  DM ,  flW  t  M« )  so  is  a  Markov  process  for  which  the 
stochastic  flow  results  of  Theorem  2.5  hold.  Proceeding  in  this  way  we  inductively 
define  X,-n^  for  the  coefficient  vector  fields  of  the  to’  integrals  in  , 

=  /'(£>«->  x<*>  (u))<K, 

JO 

®  jj(o)  )M 

and  ^(n+1)  =  ,!)(*)  ,B(*)  ,A/W‘).  Write  VB  for  the  gradient  operator  in  the  com¬ 

ponents  of  <f>^ . 

THEOREM  4.7.  Suppose  c  is  a  bounded  C°°  scalar  function  on  Rd  with  bounded 
derivatives.  Let  g  be  a  possibly  vector,  (or  matrix),  valued  function  on  the  state  space 
of  <f> M  such  that  g(<f> M  (0,  <))  and  (0,  t))  are  both  in  some  If  (VI).  Then 

Ek[c(x0jt  M*(w)  (0,  t))  ®  4%  I Yt] 

=  Eh [ex  {xqj  )g{4n)  (0,  <))  A*  M0jt  \YT) 

+  Ek[c{xos)vng(<f>W  (0,t))D$M$  \YT) 

+  Ek[L^cMg{<f>{n)  \yT 1 

-  25*[e(xo,*)y(^  (0,i))Ao,r  \Yt]- 

PROOF.  The  result  follows  by  applying  to  the  system  4>^  the  techniques  used  to  derive 
(4.11). 
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REMARKS  4.8.  Theorem  2.5  implies  supl-Di,  |>  suplAfiy  |,  sup [ |,  sup|AoI#|  are 

s<t  ’  »<t  ’  *<t  OX°  t<t 

in  IS(n  x  n,P  x  P)  for  all  1  <  p  <  oo  and,  therfore,  finite  a.s.  .  We  have  already 


noted  that 


sup|D0  }|  and  sup  L0\ 

*<t  *<t 

are  in  every  IP  (fl  x  fl,P  x  P),  1  <  p  <  oo.  To  write  out  the  above  result  in  terms  of 
r)  T 

Do  t »  —TS?1*'  and  higher  derivatives  involves  very  involved  calcuations.  Even  in  the  one 


dimensional  case  it  seems  better  to  introduce  the  sequence  of  flows  <f>^ .  Theorem  4.7 
can  again  be  thought  of  as  giving  a  conditional  ‘integration  by  parts’  formula  for  cx. 


COROLLARY  4.9.  If  M0p  is  in  some  IP (fl  x  fl,  P/,)  ta king  (0,t))  =  M0}  D0}  in 

Theorem  4.7  we  have 

*Mc,(x0lt)|Yr]  =  Ek[c{x0,t)MoJ  Dj}  ®  Roj  |yr] 

-  Pfc[c(x0it)(v1<7)(£>0li ,  M04  )D$  | YT] 

-  Eh[c(zot  )LoJr  Mo}  Do}  Af0^  \YT] 

+  Eh[c(xojt  )Mq}  Dq}  Ao,t  |ir]- 

Because  the  remaining  terms  axe  integrable  and,  therefore,  finite  a.s.  we  have  proved 
the  following  result: 


THEOREM  4.10.  Suppose  P*  is  the  probability  measure  of  DeGnition  3.8  and  (xo,*  ,yo,t ) 
is  the  solution  under  P*  of  (3.10).  Let  e  be  a  smooth  function  with  bounded  derivatives 


of  all  orders.  Then  if  is  in  some  IP  (fl  x  fl,  P*) 

|E(cz(xo^)|Vrll  <  ^(y)  sup  |c(x)| 


(4.12) 


where  K(y)  is  Yr  -measurable  and  Gnite  a.s. 


REMARKS  4.11.  Condition  (4.12)  implies  that  the  random  variable  xott(xo,yo)  has  a 
conditional  density  given  Yt,  d(x),  xG  Rd  for  almost  ail  y.  (See  Malliavin  [6]  or  Zakai 
[9]).  Now  for  any  s  <T 

Yt  c  Yr. 

so  by  Jensen’s  inequality,  from  (4.12) 

\E[cx(x0,t)\Yt}\  <  K'(y)  sup  |c(z)|.  (4.13) 

Equation  (4.13)  holds  for  s  <  t  or  s  >  t  so  the  prediction,  filtering  and  smoothing 
problems  for  the  random  variable  xoj(xQ,yo)  all  have  a  density  for  almost  all  y. 

The  remaining  question  concerns  the  existence  and  integrability  properties  of 
Mq  j-  .  These  have  been  carefully  studied,  see  Bismut  [lj,  Malliavin  (6]  and  Stroock 
[8].  For  ( x,y )  G  Rd  x  Rp  write  T**  for  the  vector  subspace  of  Rd  generated  by  the 
vector  fields  X\  (x,y), . . .  ,Xm(x,y),  and  the  Lie  brackets  of  X\  (x,y), . . .  ,Xm{x,y)  and 
Xi{x,y),...,Xn(x,y),  where  each  bracket  contains  at  least  one  of  the  vector  fields 
X\{x,y), . . .  ,Xm(x,y).  Then  in  Theorem  1.19  of  [2]  Bismut  and  Michel  show  that  for 
all  T  >  0,  is  in  I/(fl  x  fi,P*)  for  all  1  <  p  <  oo  if  the  following  condition  H  of 

Hormander  is  satisfied: 

H  :  Tzqm  is  equal  to  the  whole  of  Rd. 

As  Bismut  [1]  observes,  if  H  is  satisfied  at  (zo,yo)  then  it  is  satisfied  in  some  neigh¬ 
bourhood  of  (zo.yo).  Condition  H  is  a  local  statement  and  translates  immediately  to 
diffusions  defined  on  manifolds. 
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Finally  recall  that  if  u  is  a  non-singular  linear  map  of  Rd  to  itself,  then  the  map 
4>:u~*  u-1  has  a  derivative  <f>'{u)  which  is  a  linear  map  on  the  space  of  linear  maps  of 


Rd  to  itself  given  by  ^'(u) -h  =  — u  1  -h-u  1 .  Applying  this  to  g(Do,t ,  Afo,t )  =  D0J 
we  have 

Eh[cz{xo,t)\YT\  =  Ek[c{xq^)Mq^ Dqj  ®  Rojr  \Yt\ 

-  Bk ((v,  )(Z>#  MjJ  )M0-/  Co",1  |JV] 

-  Ek[£(xo^)M0-;‘  DjJ  ((v.JOo,,  )(£>«  mJ1  )Do}  |W] 

-  Eh  («(**,  )Ljj.  M0~;  fl0-j  |Vr] 

+  A0>r  |Kt]. 


(414) 


5.  BOUNDS  FOR  HIGHER  DERIVATIVES 


To  show  the  conditional  density  of  Xoj  is  differentiable,  in  the  prediction,  filtering 
and  smoothing  situations,  we  shall  obtain  bounds  for  higher  derivatives  of  the  form: 


i£[S(zo^)lFr]l  ^  K(y)  SUP  lc(x)l- 

°X  x€pd 


(5.1) 


where  0  <  t  <  T.  Here  a  =  (ai , . . . ,  £*<*)  is  a  multi-index  of  non-negative  integers  and 


d° 


dai  8°* 


d<*d 


dxa  dzf 1  dx J3  ' ' '  dx  f  ' 

Again  if  0  <  8  <  T  then  Jensen’s  inequality  applied  to  (5.1)  gives 


l‘E'[|^(Xo’t)ly‘]l  ~  ^(y)  lc(x)l- 


(5.2) 


A  well  known  argument  from  harmonic  analysis,  (see  [7],  or  [8]),  shows  that  if  (5.2)  is 

true  for  all  a  with  |a|  =  ai  +  ajd - l-aa<n  where  n  >  d+1  then  the  random  variable 

xo,t  (zo,Sto)  has  a  conditional  density  d(x)  given  Y$,  which  is  in  Cn~d~l  (Rd).  That  is, 
we  have  a  differentiable  conditional  density  in  the  prediction,  filtering  and  smoothing 
situations. 

To  see  how  to  proceed  apply  Corollary  4.9  to  cx  in  place  of  c.  (If  preferred, 
Corollary  4.9  could  be  applied  to  just  one  partial  derivative  in  place  of  c.  However, 
the  result  is  true  for  vector  functions  c).  This  gives: 


Eh[czx  (*o,t) \Yt\  =  Eh[cx(xott  )M0“/  Do}  ®  -So,r \Yt] 
-  Ek (c* (zo^ ) (v j y ) (Z?0,t , Mo,t  )D$  |Kr] 

—  E\  \cz  (xqj  )L>0p  M0j  Dq  }  Mo,t  | Yt) 

+  Eh[cz(xo<t)Mott  Dot  Ao ,t  |Fr|. 


(5.3) 


Consider  the  four  terms  on  the  right  of  (5.3).  Each  term  is  of  the  form 


Ek[cx(x (0 (0,r))|Kr|,  i  =  1,2, 3, 4. 

For  each  such  /i,  consider  a  function 

h{  =  A»Af0(/  D0  j 

and  apply  Theorem  4.7  to  c  and  h{  to  obtain 

Eh [c«  (x0|<  )hi (*«  (0,  t) ,  4>V  (o,  T)) | YT] 

=  EH[c(x0,t)hi(<t>{l)  (0 ,t),^(1)  (0 ,T))  ®  4%  l^r) 

-  Eh[c(xo,t)(vn(t)hi){<t>M  (0,t),^>  (0 ,T))Dg  |yr] 

-  (o,o,^(1)  (0  ,T))D$M$  \Yt] 

-  Eh[L^c(xo,t)hi(<f>{1)  (0,0, ^(1)  (°»r))^f  |yr] 

+  £*(*(*<>,*  )M*(1)  (0,0, ^(1)  (0, T))Aop  \YT]. 

Substituting  in  (5.3)  we  obtain  am  expression  on  the  right  which  involves  only  c  and 
not  its  derivatives.  This  procedure  cam  be  repeated,  using  Theorem  4.7.  At  each  stage, 
to  replace  a  term  of  the  form  Eh[cx(xott  )h(<f>(n)  (0,0,  (0,  T*)) IVt*!  by  one  involving 

only  c  define  h  =  D$J  and  applying  Theorem  4.7.  Clearly  higher  powers  of  Mq  } 

are  introduced  at  each  iteration.  However,  Hormander’s  condition  H  is  sufficient  to 
ensure  that  MqI  is  in  every  7/(0  x  0,  P*),  1  <  p  <  oo.  We  have,  therefore,  proved  the 
following  result: 
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4  V* 


THEOREM  5.1.  Suppose  Hormander’s  condition  H  is  satisfied.  Then  the  random  vari¬ 
able  Z0't  (z0,yQ),  the  solution  of  the  signal  process,  has  a  conditional  density  given  Yt 
for  almost  all  y  which  is  in  C°°(Rd)  for  s  >t  and  s  <  t.  That  is,  under  condition  H 
the  prediction,  filtering  and  smoothing  problems  have  a  smooth  density  solution. 
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